Broadcast transmitting apparatus and broadcast transmitting method for providing an object-based audio, and broadcast playback apparatus and broadcast playback method

ABSTRACT

A broadcast transmitting apparatus and method, and a broadcast playback apparatus and method for providing an object-based audio by encoding and decoding a multichannel audio signal are provided. The broadcast transmitting apparatus may generate audio identification information used to determine whether the multichannel audio signal is an object-based audio signal. When the multichannel audio signal is determined to be the object-based audio signal, based on the audio identification information, the broadcast playback apparatus may control and output the multichannel audio signal for each channel.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application is a Continuation of U.S. patent application Ser. No.15/230,616 filed on Aug. 8, 2016, which is a Continuation of U.S. patentapplication Ser. No. 13/996,852 filed on Jun. 21, 2013, which is aNational Stage of International Application No. PCT/KR2011/009971, filedDec. 22, 2011 and published as WO 2012/087042 on Jun. 28, 2012, whichclaims the benefit under 35 U.S.C. § 119(a) Korean Patent ApplicationNo. 10-2010-0132665, filed on Dec. 22, 2010, in the Korean IntellectualProperty Office, the entire disclosures of which are incorporated hereinby reference for all purposes.

TECHNICAL FIELD

The present invention relates to a technology that may provide anobject-based audio by encoding and DECODING A MULTICHANNEL AUDIO SIGNAL.

BACKGROUND Art

An object-based audio service enables a user to listen to audio whilecontrolling various sound sources included in the audio. In general, inmusic, a voice of a singer, and a musical instrument, such as a guitar,a piano, a bass, and the like are mixed. Unlike a general audio service,in the object-based audio service, various musical instruments, a voiceof a singer, and the like may be independently encoded, or stored and/ortransmitted, rather than being mixed. When the object-based audioservice is used, a playback terminal may individually control only avoice of a singer, or each musical instrument.

However, since encoding and decoding may be independently performed foreach audio object in the object-based audio service, the object-basedaudio service may be incompatible with a broadcasting system, such as aDigital Television (DTV), a Digital Multimedia Broadcasting (DMB)system, a Digital Audio Broadcasting (DAB) system, and the like. Inother words, it is difficult to control audio signals for each object ina broadcast transmitting and playback apparatus, such as a DTV, a DMBsystem, a DAB system, and the like in the broadcasting system.

Accordingly, there is a desire for a technology that may provide anobject-based audio service in a broadcasting system, such as a DTV, aDMB system, a DAB system, and the like.

DISCLOSURE OF INVENTION Technical Goals

An aspect of the present invention provides a broadcast transmittingapparatus and method, and a broadcast playback apparatus and method thatmay provide an object-based audio service using audio identificationinformation in a broadcasting system, such as a Digital Television(DTV), a Digital Multimedia Broadcasting (DMB) system, a Digital AudioBroadcasting (DAB) system, and the like.

Technical Solutions

According to an aspect of the present invention, there is provided abroadcast transmitting apparatus, including an audio encoder to encode amultichannel audio signal, and an audio identification informationgenerator to generate audio identification information, the audioidentification information being used to determine whether themultichannel audio signal is an object-based audio signal.

The broadcast transmitting apparatus may further include a video encoderto encode a video signal.

The broadcast transmitting apparatus may further include a packetizingand multiplexing unit to packetize the audio identification information,mixing information, and the encoded multichannel audio signal, and tomultiplex the packetized audio identification information, thepacketized mixing information, and the packetized multichannel audiosignal.

When the multichannel audio signal is determined to be the object-basedaudio signal, the audio identification information generator maygenerate at least one piece of mixing information including a scheme ofmixing channels.

The audio identification information generator may generate the audioidentification information in the form of a descriptor.

According to another aspect of the present invention, there is provideda broadcast transmitting method including encoding a multichannel audiosignal, and generating audio identification information, the audioidentification information being used to determine whether themultichannel audio signal is an object-based audio signal.

The broadcast transmitting method may further include packetizing theaudio identification information, mixing information, and the encodedmultichannel audio signal, and multiplexing the packetized audioidentification information, the packetized mixing information, and thepacketized multichannel audio signal.

According to still another aspect of the present invention, there isprovided a broadcast playback apparatus, including a multichannel audiosignal determining unit to determine whether a multichannel audio signalis an object-based audio signal, based on audio identificationinformation extracted from a bitstream, and an audio decoder to decodean encoded multichannel audio signal.

When at least one piece of mixing information is input, a downmixingunit in the broadcast playback apparatus may downmix the multichannelaudio signal to a stereo audio signal, based on mixing information setas default information among the input mixing information.

When at least one piece of mixing information is input, the downmixingunit may downmix the multichannel audio signal to the stereo audiosignal, based on mixing information selected by a user operation fromamong the input mixing information.

The downmixing unit may downmix the multichannel audio signal to thestereo audio signal, based on mixing information input by a useroperation.

According to yet another aspect of the present invention, there isprovided a broadcast playback method, including determining whether amultichannel audio signal is an object-based audio signal, based onaudio identification information, and decoding an encoded multichannelaudio signal.

BRIEF DESCRIPTION OF DRAWINGS

FIG. 1 is a block diagram illustrating a configuration of a broadcasttransmitting apparatus according to an embodiment of the presentinvention.

FIG. 2 is a flowchart illustrating an operation of a broadcasttransmitting apparatus according to an embodiment of the presentinvention.

FIG. 3 is a block diagram illustrating a configuration of a broadcastplayback apparatus according to an embodiment of the present invention.

FIG. 4 is a flowchart illustrating an operation of a broadcast playbackapparatus according to an embodiment of the present invention.

BEST MODE FOR CARRYING OUT THE INVENTION

Reference will now be made in detail to embodiments of the presentinvention, examples of which are illustrated in the accompanyingdrawings, wherein like reference numerals refer to the like elementsthroughout. The embodiments are described below in order to explain thepresent invention by referring to the figures.

FIG. 1 is a block diagram illustrating a configuration of a broadcasttransmitting apparatus according to an embodiment of the presentinvention.

Referring to FIG. 1, a broadcast transmitting apparatus 100 may includea video encoder 110, an audio encoder 120, an audio identificationinformation generator 130, and a packetizing and multiplexing unit 140.

The video encoder 110 may encode a video signal using variouscompression algorithms, such as a Moving Picture Experts Group (MPEG),and the like.

The audio encoder 120 may encode a multichannel audio signal.

The audio identification information generator 130 may generate audioidentification information. The audio identification information may beused to determine whether the multichannel audio signal is anobject-based audio signal, or a surround audio signal.

The audio identification information may be generated in the form of adescriptor by the audio identification information generator 130. Theaudio identification information in the form of the descriptor may beinserted into a PMT (Program Map Tale) of an MPEG-2 Transport Stream(TS), and may be transmitted to a broadcast playback apparatus.

Additionally, the audio identification information generator 130 maygenerate at least one piece of mixing information including a scheme ofmixing channels. For example, when the multichannel audio signal is anobject-based audio signal, the audio identification informationgenerator 130 may generate audio identification information in which themixing information is inserted. Similarly, the audio identificationinformation in which the mixing information is inserted may be generatedin the form of a descriptor by the audio identification informationgenerator 130.

The audio identification information generator 130 may generate mixinginformation associated with a voice of a singer and the like, a firstmusical instrument, and a second musical instrument. For example, theaudio identification information generator 130 may generate first mixinginformation used to mix the voice, the first musical instrument, and thesecond musical instrument in a ratio of ‘1:1:1’, may generate secondmixing information used to mix the voice, the first musical instrument,and the second musical instrument in a ratio of ‘1:0:1’, and maygenerate third mixing information used to mix the voice, the firstmusical instrument, and the second musical instrument in a ratio of‘1:1:0’. Additionally, the audio identification information generator130 may generate audio identification information in which the firstmixing information through the third mixing information are inserted.Here, one of the first mixing information through the third mixinginformation may be set in advance as default information.

The packetizing and multiplexing unit 140 may individually packetize theencoded video signal, the encoded audio signal, and the audioidentification information. Additionally, the packetizing andmultiplexing unit 140 may generate a single bitstream by multiplexingthe packetized video signal, the packetized audio signal, and thepacketized audio identification information. The broadcast transmittingapparatus 100 may transmit the generated bitstream to the broadcastplayback apparatus.

The broadcast transmitting apparatus of FIG. 1 may store the encodedmultichannel audio signal, and the audio identification information in astorage medium, such as a Universal Serial Bus (USB), an external harddisc, a Blu-ray disc, DVD, and the like. Here, the audio identificationinformation may be stored in the form of a descriptor, in the storagemedium.

Additionally, the broadcast transmitting apparatus of FIG. 1 mayinclude, for example, a portable terminal, a home terminal, a vehicleterminal, and the like that may function as a Digital Television (DTV),a Digital Multimedia Broadcasting (DMB) system, a Digital AudioBroadcasting (DAB) system, and the like.

FIG. 2 is a flowchart illustrating an operation of a broadcasttransmitting apparatus according to an embodiment of the presentinvention.

In operation 210, the broadcast transmitting apparatus may encode avideo signal using a video compression algorithm, such as an MPEG-2,High Efficiency Video Coding (HEVC), and the like.

In operation 220, the broadcast transmitting apparatus may encode amultichannel audio signal using an audio compression algorithm, such asAudio Coding-3 (AC-3), Advanced Audio Coding (AAC), Bit-SlicedArithmetic Coding (BSAC), and the like.

In operation 230, the broadcast transmitting apparatus may generateaudio identification information. The audio identification informationmay be used to determine whether the multichannel audio signal is anobject-based audio signal, or a surround audio signal. The audioidentification information may be generated, for example, in the form ofa descriptor.

The broadcast transmitting apparatus may generate at least one piece ofmixing information including a scheme of mixing channels. For example,when the multichannel audio signal is an object-based audio signal, thebroadcast transmitting apparatus may generate audio identificationinformation in which the mixing information is inserted. Here, the audioidentification information in which the mixing information is insertedmay be generated in the form of a descriptor.

In operation 240, the broadcast transmitting apparatus may generate asingle bitstream, by individually packetizing the encoded video signal,the encoded audio signal, and the audio identification information, andby multiplexing the packetized video signal, the packetized audiosignal, and the packetized audio identification information.

An order of operations 210 through 230 of FIG. 2 may be changed. Inother words, the encoding of the video signal, the encoding of themultichannel audio signal, and the generating of the audioidentification information may be performed in a different order fromthat shown.

In FIG. 2, the broadcast transmitting apparatus may store, in a storagemedium, the encoded multichannel audio signal and the audioidentification information. Here, the audio identification informationmay include at least one piece of mixing information. Additionally, thestored audio identification information may be in the form of adescriptor.

FIG. 3 is a block diagram illustrating a configuration of a broadcastplayback apparatus according to an embodiment of the present invention.

Referring to FIG. 3, a broadcast playback apparatus 300 may include adepacketizing unit 310, a video decoder 320, an audio decoder 330, amultichannel audio signal determining unit 340, and a downmixing unit350.

The depacketizing unit 310 may demultiplex and depacketize a bitstream.

For example, the depacketizing unit 310 may demultiplex a bitstreamreceived from a broadcast transmitting apparatus, and may extract, fromthe bitstream, an encoded multichannel audio signal, an encoded videosignal, and audio identification information. Here, the audioidentification information may include at least one piece of mixinginformation.

Additionally, the depacketizing unit 310 may individually depacketizethe encoded multichannel audio signal, the encoded video signal, and theaudio identification information.

The video decoder 320 may decode the encoded video signal, based onvideo information included in signaling information.

The audio decoder 330 may decode the encoded multichannel audio signal,based on audio information included in the signaling information. Here,the signaling information may be included in the bitstream, and receivedfrom the broadcast transmitting apparatus.

The multichannel audio signal determining unit 340 may determine, basedon the audio identification information, whether the multichannel audiosignal is an object-based audio signal, or a surround audio signal.Here, the audio identification information may be in the form of adescriptor.

When the multichannel audio signal is determined to be the surroundaudio signal, the audio decoder 330 may output the decoded multichannelaudio signal, without a change.

When the multichannel audio signal is determined to be the object-basedaudio signal, the downmixing unit 350 may downmix the multichannel audiosignal to a stereo audio signal, based on at least one piece of mixinginformation. Here, the mixing information may be inserted in the audioidentification information, and received from the broadcast transmittingapparatus, or may be input by a user operation.

When the at least one piece of mixing information in the audioidentification information is used, the downmixing unit 350 may downmixthe multichannel audio signal to the stereo audio signal, based onmixing information set as default information among the at least onepiece of mixing information.

In an example in which audio identification information includes firstmixing information, second mixing information, and third mixinginformation, and in which the first mixing information is set in advanceas default information, the downmixing unit 350 may downmix themultichannel audio signal to the stereo audio signal, based on the firstmixing information. Here, the first mixing information may be used tomix a voice, a first musical instrument, and a second musical instrumentin a ratio of ‘1:1:1’, and the second mixing information may be used tomix the voice, the first musical instrument, and the second musicalinstrument in a ratio of ‘1:0:1.’ Additionally, the third mixinginformation may be used to mix the voice, the first musical instrument,and the second musical instrument in a ratio of ‘1:1:0.’

In another example in which the third mixing information is selected bya user operation from among the first mixing information through thethird mixing information, the downmixing unit 350 may downmix themultichannel audio signal to the stereo audio signal, based on the thirdmixing information.

In still another example in which audio identification informationincludes at least one piece of mixing information, and in which mixinginformation is input by a user operation, the downmixing unit 350 maydownmix the multichannel audio signal to the stereo audio signal, basedon the mixing information input by the user operation.

In yet another example in which a user desires to mix the voice, thefirst musical instrument, and the second musical instrument in aspecific ratio, regardless of the first mixing information through thethird mixing information, the user may input mixing information used tomix the voice, the first musical instrument, and the second musicalinstrument in a desired ratio, using an operating unit (not shown)included in the broadcast playback apparatus, or using a remotecontroller. In this example, when the user inputs mixing informationused to mix the voice, the first musical instrument, and the secondmusical instrument in a ratio of ‘1:0.5:0.5’, the downmixing unit 350may downmix the multichannel audio signal to the stereo audio signal,based on the ratio of ‘1:0.5:0.5’, to output the stereo audio signal.

FIG. 4 is a flowchart illustrating an operation of a broadcast playbackapparatus according to an example of the present invention.

In operation 410, the broadcast playback apparatus may demultiplex abitstream.

By demultiplexing the bitstream, the broadcast playback apparatus mayextract, from the bitstream, at least one of an encoded video signal, anencoded multichannel audio signal, and audio identification information.Here, the audio identification information may include at least onepiece of mixing information. The audio identification information may bein the form of a descriptor.

In operation 420, the broadcast playback apparatus may individuallydepacketize the encoded video signal, the encoded multichannel audiosignal, and the audio identification information. Here, the audioidentification information may include at least one piece of mixinginformation. The audio identification information may be in the form ofa descriptor.

In operation 430, the broadcast playback apparatus may decode theencoded video signal, based on video information included in signalinginformation. Here, the signaling information may include the videoinformation, audio information, and the like. Additionally, thesignaling information may be included in the bitstream, and receivedfrom a broadcast transmitting apparatus.

In operation 440, the broadcast playback apparatus may decode theencoded multichannel audio signal, based on the audio informationincluded in the signaling information.

In operation 450, the broadcast playback apparatus may analyze the audioidentification information.

Specifically, the broadcast playback apparatus may determine whether themultichannel audio signal is an object-based audio signal, or a surroundaudio signal, by analyzing the audio identification information.

When the multichannel audio signal is determined to be the object-basedaudio signal (460:YES), the broadcast playback apparatus may downmix themultichannel audio signal to a stereo audio signal, based on at leastone piece of mixing information in operation 470.

Specifically, the broadcast playback apparatus may downmix themultichannel audio signal to the stereo audio signal, based on the atleast one piece of mixing information included in the audioidentification information.

In an example, the downmixing unit 350 may downmix the multichannelaudio signal to the stereo audio signal, based on mixing information setas default information among the at least one piece of mixinginformation.

In another example, the broadcast playback apparatus may downmix themultichannel audio signal to the stereo audio signal, based on mixinginformation selected by a user operation from among the at least onepiece of mixing information in the audio identification information.

In still another example, the broadcast playback apparatus may downmixthe multichannel audio signal to the stereo audio signal, based onmixing information input by a user operation. For example, when a userdoes not desire to downmix the multichannel audio signal based on the atleast one piece of mixing information in the audio identificationinformation, the broadcast playback apparatus may receive mixinginformation from the user. Additionally, the broadcast playbackapparatus may downmix the multichannel audio signal to the stereo audiosignal, based on the mixing information received from the user, using anoperating unit (not shown) such as a key button, a touch panel, and thelike, or using a remote controller.

When the multichannel audio signal is determined to be the surroundaudio signal (460:NO), the broadcast playback apparatus may output thedecoded multichannel audio signal without a change, in operation 480.

An order of operations 430 through 450 of FIG. 6 may be changed. Inother words, the decoding of the video signal, the decoding of themultichannel audio signal, and the analyzing of the audio identificationinformation may be performed in a different order from that shown.

The decoding of the multichannel audio signal based on the bitstreamreceived from the broadcast transmitting apparatus, or the downmixing ofthe multichannel audio signal to the stereo audio signal has beendescribed above with reference to FIGS. 3 and 4. Additionally, thebroadcast playback apparatus may output the stereo audio signal, or thedecoded multichannel audio signal, based on the encoded multichannelaudio signal and the audio identification information that are stored inthe storage medium.

The encoding of the video signal and the audio signal, and thegenerating of the audio identification information have been describedwith reference to FIGS. 1 through 4, however, processing of the videosignal may be omitted. For example, when a radio is used, the broadcasttransmitting apparatus of FIGS. 1 and 2 may not require a video encoderto encode a video signal. Similarly, the broadcast playback apparatus ofFIGS. 3 and 4 may not require a video decoder to decode a video signal.

Although a few embodiments of the present invention have been shown anddescribed, the present invention is not limited to the describedembodiments. Instead, it would be appreciated by those skilled in theart that changes may be made to these embodiments without departing fromthe principles and spirit of the invention, the scope of which isdefined by the claims and their equivalents.

1. A broadcast playback apparatus, comprising: a multichannel audiosignal determining unit to determine whether an encoded multichannelaudio signal in a bitstream is an object-based audio signal, based onaudio identification information; and an audio decoder to decode theencoded multichannel audio signal, when encoded the multichannel audiosignal is the object-based audio signal.
 2. The broadcast playbackapparatus of claim 1, further comprising: a downmixing unit to downmixthe multichannel audio signal to a stereo audio signal, when the encodedmultichannel audio signal is determined to be the object-based audiosignal.
 3. The broadcast playback apparatus of claim 1, wherein thedownmixing unit downmixes the multichannel audio signal to the stereoaudio signal, based on mixing information.
 4. The broadcast playbackapparatus of claim 1, wherein the downmixing unit downmixes themultichannel audio signal to the stereo audio signal, based on (i)mixing information selected by a user operation from among the inputmixing information or (ii) mixing information input by a user operation.5. The broadcast playback apparatus of claim 1, wherein the audiodecoder output the decoded multichannel audio signal without a change,when the encoded multichannel audio signal is a surround audio signal.6. A broadcast playback method, comprising: determining whether anencoded multichannel audio signal in a bitstream is an object-basedaudio signal, based on audio identification information; and decodingthe encoded multichannel audio signal, when encoded the multichannelaudio signal is the object-based audio signal.
 7. The broadcast playbackmethod of claim 6, further comprising: downmixing the multichannel audiosignal to a stereo audio signal, when the encoded multichannel audiosignal is determined to be the object-based audio signal.
 8. Thebroadcast playback method of claim 6, wherein the downmixing comprises:downmixing the multichannel audio signal to the stereo audio signal,based on mixing information.
 9. The broadcast playback method of claim6, wherein the downmixing comprises: downmixing the multichannel audiosignal to the stereo audio signal, based on (i) mixing informationselected by a user operation from among the input mixing information, or(ii) mixing information input by a user operation.
 10. The broadcastplayback method of claim 6, wherein the decoding comprising: outputtingthe decoded multichannel audio signal without a change, when the encodedmultichannel audio signal is a surround audio signal.